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Abstract. We investigate the global GRAMMAR constraint over restricted classes 
of context free grammars like deterministic and unambiguous context-free gram- 
mars. We show that detecting disentailment for the GRAMMAR constraint in these 
cases is as hard as parsing an unrestricted context free grammar. We also consider 
the class of linear grammars and give a propagator that runs in quadratic time. Fi- 
nally, to demonstrate the use of linear grammars, we show that a weighted linear 
Grammar constraint can efficiently encode the EditDistance constraint, and 
a conjunction of the EditDistance constraint and the REGULAR constraint. 



1 Introduction 

In domains like staff scheduling, regulations can often be naturally expressed using 
formal grammars. Pesant [9] introduced the global REGULAR constraint to express 
problems using finite automaton. Sellmann [14] and Quimper and Walsh [11] then in- 
troduced the global GRAMMAR constraint for context-free grammars. Unlike parsing 
which only finds a single solution, propagating such a constraint essentially consid- 
ers all solutions. Nevertheless, a propagator for the REGULAR constraint runs in linear 
time and a propagator for the GRAMMAR constraint runs in cubic time just like the cor- 
responding parsers. Subsequently, there has been research on more efficient propaga- 
tors for these global constraints [12, 10, 4, 7, 6]. Whilst research has focused on regular 
and unrestricted context-free languages, a large body of work in formal language the- 
ory considers grammars between regular and context-free. Several restricted forms of 
context-free grammars have been proposed that permit linear parsing algorithms whilst 
being more expressive than regular grammars. Examples of such grammars are LL(k), 
LR(1), and LALR. Such grammars play an important role in compiler theory. For in- 
stance, yacc generates parsers that accept LALR languages. 

In this paper we explore the gap between the second and third levels of the Chom- 
sky hierarchy for classes of grammars which can be propagated more efficiently than 
context-free grammars. These classes of grammar are attractive because either they 
have a linear or quadratic time membership test (e.g., LR(k) and linear grammars, re- 
spectively) or they permit counting of strings of given length in polynomial time (e.g., 
unambiguous grammars). The latter may be useful in branching heuristics. One of our 
main contributions is a lower bound on the time complexity for propagating grammar 
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constraints using deterministic or unambiguous grammars. We prove that detecting dis- 
entailment for such constraints has the same time complexity as the best parsing algo- 
rithm for an arbitrary context-free grammar. Using LL(k) languages or unambiguous 
grammars does not therefore improve the efficiency of propagation. Another contri- 
bution is to show that Unearity of the grammar permits quadratic time propagation. 
We show that we can encode an EditDistance constraint and a combination of an 
EditDistance constraint and a Regular constraint using such a linear grammar. 
Experimental results show that this encoding is very efficient in practice. 

2 Background 

A context-free grammar is a tuple G = {N, T, P, S), where is a finite set of non- 
terminal symbols, T is a finite set of terminal symbols, P is a set of productions, and 
S G A?^ is the start symbol. A production is of the form A ^ a where A G N and 
a e (A^ U T)"*". The derivation relation induced by G is defined as follows: for any 
u,v £ {Nut)*, uAv =>g uav if there exists a production A ^ ainP. Sometimes we 
additionally index =^>g by the production that is applied. The transitive, reflexive closure 
of =>G is denoted by A string in s £ T* is generated by G if S" =>q s. The set 
of all strings generated by G is denoted L{G). Note that this does not allow the empty 
string £ as right-hand side of a production. Hence, e ^ ^{G). This is not a restriction: 
we can add a new start symbol Z with productions Z ^ e \ S \.o our grammars. Our 
results can easily be generahzed to such £-enriched grammars. We denote the length of 
a string s by |s|. The size of G, denoted by |G|, is X^A^aGP 

A context-free grammar is in Chomsky form if all productions are of the form A 
BC where B and C are non-terminals or A — > a where a is a terminal. Any e-free 
context-free grammar G can be converted to an equivalent grammar G' in Chomsky 
form with at most a linear increase in its size; in fact, \G'\ < 3|G|, see Section 4.5 
in [3]. A context-free grammar is in Greibach form if all productions are of the form 
A — > oa where a is a terminal and a is a (possibly empty) sequence of non-terminals. 
Any context-free grammar G can be converted to an equivalent grammar G' in Greibach 
form with at most a polynomial increase in its size; in fact, the size of G' is in 0(|G|"^) 
in general, and is in 0(|Gp) if G has no chain productions of the form A ^ B for 
nonterminals A, B, see [1]. A context-free grammar is regular if all productions are of 
the forms A — > w or A — > wB for non-terminals A, B and w G T+. 

3 Simple Context-Free Grammars 

In this section we show that propagating a simple context-free grammar constraint is at 
least as hard as parsing an (unrestricted) context-free grammar. A grammar G is simple 
if it is in Greibach form, and for every non-terminal A and terminal a there is at most 
one production of the form A — > aa. Hence, restricting ourselves to languages recog- 
nized by simple context-free grammars does not improve the complexity of propagating 
a global grammar constraint. Simple context-free languages are included in the deter- 
ministic context-free languages (characterized by deterministic push-down automata), 
and also in the LL{1) languages [13], so this result also holds for propagating these 



classes of languages. Given finite sets Di,. . . , Dn, their Cartesian product language 
L{Rdi,...,d„) is the cross product of the domains {aia2 • • • fln I G £>i, . . . , a„ G 
Following [14], we define the global GRAMMAR constraint: 

Definition 1. The Grammar([Xi. . . . , X„], G) constraint is true for an assignment 

variables X iff a string fonned by this assignment belongs to L{G). 

From Definition 1, we observe that finding a support for the grammar constraint is 
equivalent to intersecting the context-free language with the Cartesian product language 
of the domains. 

Proposition 1. Let G be a context-free grammar, Xi, . . . , X„ be a sequence of vari- 
ables with domains D[Xi), . . . Then L{G) D L{Rd(^Xi),...,d{x^)) 7^ iff 
Grammar([Xi, . . . , Xn], G) has a support. 

Context-free grammars are effectively closed under intersection with regular gram- 
mars. To see this, consider a context-free grammar G in Chomsky form and a reg- 
ular grammar R. Following the "triple construction", the intersection grammar has 
non-terminals of the form (F, A, F') where F, F' are non-terminals of R and ^ is a 
non-terminal of G. Intuitively, (K A, F') generates strings ?i> that are generated by A 
and also by F, through a derivation from F to wF' . If A ^ BC is a production of 
G, then we add, for all non-terminals F, F', F" of R, the production (F, A,F") 
{F, B, F'){F' , G, F"). The resulting grammar is 0(\G\rt') in size where n is the num- 
ber of non-terminals of R. This is similar to the construction of Theorem 6.5 in [3] 
which uses push-down automata instead of grammars. Since emptiness of context-free 
grammars takes linear time (cf. [3]) we obtain through Proposition 1 a cubic time algo- 
rithm to check whether a global constraint Grammar( [Xi , . . . , Xn] , G) has support. 
In fact, this shows that we can efficiently propagate more complex constraints, such as 
the conjunction of a context-free with a regular constraint. Note that if i? is a Carte- 
sian product language then the triple construction generates the same result as the CYK 
based propagator for the Grammar constraint [14, 1 1]. 

We now show that for simple context-free grammars G, detecting disentailment of 
the constraint Grammar([Xi , . . . , X^] , G), i.e. testing whether it has a solution, is at 
least as hard as parsing an arbitrary context-free grammar. 

Theorem 1. Let G be a context-free grammar in Greibach form and s a string 
of length n. One can construct in 0{\G\) time a simple context-free grammar G' 
and in 0{\G\n) time a Cartesian product language L{Ru(^Xi),...,D{x„)) such that 

L{G)nL{RDix,)....Mx„))^9iffseL{G). 

Proof. The idea behind the proof is to determinize an unrestricted context free gram- 
mar G by mapping each terminal in G to a set of pairs - the terminal and a pro- 
duction that can consume this terminal. This allows us to carry information about 
the derivation inside a string in G'. Then, we construct a Cartesian product language 
L{Rd{Xi)....,d{x„)) over these pairs so that all strings from this language map only to 
the string s. Let G = {N, T, P, S) and fix an arbitrary order of the productions in P. 
We now construct the grammar G' = {N, T', P' , S). For every 1 < j < if the j-th 
production of Pis A ^ aa then let (a, j) be a new symbol in T' and let the production 



A — > {a,j)a be in P'. Next, we construct the Cartesian product language. We define 
D{Xi) — {{a,j)\{si = a) A {a,j) G T'}, i = 1, . . . ,n and Si is the i-th letter of s. 
Clearly, G' is constructed in 0(|G|) time and i^(-R£)(Xi),...,£)(x„)) in time. 

(=^>) Let L{G') n i(-RD(Xi),...,D(x„)) be non empty. Then there exits a string s' 
that belongs to the intersection. Let s' = (ai,ii) • • • (a„,i„). By the definition of 
L{RD{Xi),...,D{Xn))' the string aia2 ■ ■ - an must equal s. Since s' £ L{G'), there must 
be a derivation by G of the form 

S =>G,pi aiOi =>G,p2 "1020;' • • • =^G,p„ ai • • • a„ 

where is the j-th production in P. Hence, .s e L{G). 

(<^=) Let s € -L(G). Consider a derivation sequence of the string s. We replace every 
symbol a in s that was derived by the i-th production of G by (a, i). By the construction 
of G', the string s' is in L{G'). Moreover, s' is also in L{Rd{Xi),...,d(x„))- □ 

Note that context-free parsing has a quadratic time lower bound, due to its connec- 
tion to matrix multiphcation [8]. Given this lower bound and the fact that the construc- 
tion of Theorem 1 requires only linear time, we can deduce the following. 

Corollary 1. Let G be a context-free grammar. If G is simple { or deterministic or 
LL{1)) then detecting disentailment o/Gramm AR ( [Xi , X„] , G) is at least as hard 
as context-free parsing of a string of length n. 

We now show the converse to Theorem 1 which reduces intersection emptiness 
of a context-free with a regular grammar, to the membership problem of context- 
free languages. This shows that the time complexity of detecting disentailment for the 
Grammar constraint is the same as the time complexity of the best parsing algorithm 
for an arbitrary context free grammar. Therefore, our result shows that detecting dis- 
entailment takes 0(n^^) time [2J, as in the best known algorithm for Boolean matrix 
multiphcation. It does not, however, improve the asymptotic complexity of a domain 
consistency propagator for the Grammar constraint [14, 1 1]. 

Theorem 2. LetG = {N,T^ P. S) be acontext-free grammar and L{R[)i^Xi),....d(x,,)) 
be Cartesian product language. One can construct in time 0{\G\ a context- 

free grammar G' and in time 0{n\T\) a string s such that s € L{G') iff L{G) fl 

Proof. (Sketch) We assign an index to each terminal in T. For each position i of the 
strings of R, we create a bitmap of the alphabet that describes the terminals that may 
appear in that position. The j-th bit of the bitmap is 1 iff the symbol with index j may 
appear at position i. The string s is the concatenation of the bitmaps for each position 
and has size n\T\. First, we add B ^ Q and -B ^ 1 to G'. For each terminal in T with 
index j, we introduce Tj — »■ B^~^\B^'^^~^ into G' to accept any bitmap with 1 at the 
j'-th position. Then, for each production in G of the form A ^ aa such that the index 
of a is j, we add A Tja to G'. In this construction, every production in G' except 
for those with Tj on the left hand side can be uniquely mapped to a production in G. It 
can be shown that s G L{G') iff L(G) n L{Rd(Xi),...,d{x„)) ¥=1>- □ 



4 Linear Context-Free Grammars 



A context-free grammar is linear if every production contains at most one non-terminal 
in its right-hand side. The Unear languages are a proper superset of the regular lan- 
guages and are a strict subset of the context-free languages. Linear context-free gram- 
mars possess two important properties: (1) membership of a given string of length n 
can be checked in time O(n^) (see Theorem 12.3 in [15]), and (2) the class is closed 
under intersection with regular grammars (to see this, apply the "triple construction" as 
explained after Proposition 1). The second property opens the possibihty of construct- 
ing a polynomial time propagator for a conjunction of the the linear Grammar and the 
Regular constraints. Interestingly, we can show that a CYK-based propagator for this 
type of grammars runs in quadratic time. This is then the third example of a grammar, 
besides regular and context-free grammars, where the asymptotic time complexity of 
the parsing algorithm and that of the corresponding propagator are equal. 

Theorem 3. Let G be a linear grammar and Grammar([Xi, . . . , Xn], G) be the cor- 
responding global constraint. There exists a domain consistency propagator for this 
constraint that runs in 0{n^\G\) time. 

Proof. We convert G = {N, T, P, S) into CNF. Every linear grammar can be converted 
into the form A — > aB, A — > Ba and A ^ a, where a,b G T and A,BgN (see 
Theorem 12.3 of [15]) in 0(|G|) time. To obtain CNF we replace every terminal a G T 
that occurs in a production on the right hand side with a new non-terminal Ya and 
introduce a production ^ a. 

Consider the CYK-based domain consistency propagator for an arbitrary context- 
free grammar constraint [1 1, 14]. The algorithm runs in two stages. In the first stage, it 
constructs in a bottom-up fashion a dynamic programing table V^xn, where an element 
A of Vij is a potential non-terminal that generates a substring from the domains of 
variables [Xi, . . . , Xi+j]. In the second stage, it performs a top-down traversal of V and 
marks an element A of Vij iff it is reachable from the starting non-terminal S using 
productions of the grammar and elements of V. It then removes unmarked elements, 
including terminals. If it removes a terminal at column i of the table, it prunes the 
corresponding value of variable Xi. 

The complexity of this algorithm is bounded by the number of possible 1-step 
derivations from each non-terminal in the table. Let G' = {N' ,T' , P' , S') be an 
arbitrary context free grammar. There are 0{\N'\n'^) non-terminals in the table and 
each non-terminal can be expanded in 0(F' {A)n) possible ways, where F'{A) is the 
number of productions in G" with non-terminal A on the left-hand side. Therefore, 
the total time complexity of the propagator for unrestricted context-free grammars is 

J^AeN' nF'{A) = 0(n'^|G"|). In contrast, the number of possible 1-step deriva- 
tions from each non-terminal in linear grammars is bounded by 0{F{A)). Therefore, 
the propagator runs in O (n^ | G |) f or a hnear grammar G. □ 

Theorem 3 can be extended to the weighted form of the linear Grammar con- 
straint, WeightedCFG [5]. A weighted grammar is annotated with a weight for 
each production and the weight of a derivation is the sum of all weights used in it. 
The hnear WeightedCFG(G, Z,[Xi,.. ., X„] ) constraint holds iff an assignment X 



foms a string belonging to the weighted linear grammar G and the minimal weight 
derivation of X is less than or equal to Z. The domain consistency propagator for the 
WeightedCFG constraint is an extension of the propagator for Grammar that com- 
putes additional information for each non-terminal A e Vij — ^the minimum and the 
maximum weight derivations from A. Therefore, this algorithm has the same time and 
space asymptotic complexity as the propagator for Grammar, so the complexity anal- 
ysis for the linear WeightedCFG constraint is identical to the non-weighted case. 

It is possible to restrict linear grammars further, so that the resulting global con- 
straint problem is solvable in linear time. As an example, consider "fixed-growth" 
grammars in which there exists I and r with Z -I- r > 1 such that every production 
is of the form either A — > w G T+ or A — > uBw where the length of m G T* equals I 
and the length of w e T* equals r. In this case, the triple construction (explained below 
Proposition 1) generates 0{\G\n) new non-terminals implying linear time propagation 
(similarly, CYK runs in linear time as it only generates non-terminals on the diagonal of 
the dynamic program). A special case of fixed- growth grammars are regular grammars 
which have I = 1 and r = (or vice versa). 

5 The EditDistance Constraint 

To illustrate linear context-free grammars, we show how to encode an edit distance 
constraint into such a granmnar. EditDistance ([Xi, . . . , X„, Fi, . . . , Ym],N) holds 
iff the edit distance between assignments of two sequences of variables X and Y is less 
than or equal to N. The edit distance is the minimum number of deletion, insertion 
and substitution operations required to convert one string into another. Each of these 
operations can change one symbol in a string. W.L.O.G. we assume that n = m. We will 
show that the EditDistance constraint can be encoded as a Unear WeightedCFG 
constraint. The idea of the encoding is to parse matching substrings using productions 
of weight and to parse edits using productions of weight 1 . 

We convert EditDistance([X, Y], A?^) into a Unear 
WeightedCFG([Z2„+i, A/', Ged) constraint. The first n variables in the se- 
quence Z are equal to the sequence X, the variable Z„+i is ground to the sentinel 
symbol # so that the grammar can distinguish the sequences X and Y, and the last 
n variables of the sequence Z are equal to the reverse of the sequence Y. We define 
the linear weighted grammar Ged as follows. Rules S dSd with weight w = 0, 
Vd e D{X) U D(Y), capture matching terminals, rules S — » d\Sd2 with w = 1, 
Vdi e D{X),d'2 G D{Y),d-i ^ ^2, capture replacement, rules 5 — > dS\Sd v/ith 
w = 1, Vd G D{X), capture insertions and deletions. Finally, the rule S* ^ # with 
weight w = generates the sentinel symbol. As discussed in the previous section, the 
propagator for the linear WeightedCFG constraint takes 0(n^|G|) time. Down a 
branch of the search tree, the time complexity is 0(n'^\G\ub{N)). 

We can use this encoding of the EditDistance constraint into a Unear 
WeightedCFG constraint to construct propagators for more complex constraints. For 
instance, we can exploit the fact that linear grammars are closed under intersection 
with regular grammars to propagate efficiently the conjunction of an EditDistance 
constraint and Regular constraints on each of the sequences X,Y. More formaUy, 



let X and Y be two sequences of variables of length n subject to the constraints 
REGULAR(X,i?i), Regular (Y,i?2) and EditDistance(X, Y, iV). We construct a 
domain consistency propagator for the conjunction of these three constraints, by com- 
puting a grammar that generates strings of length 2n + 1 which satisfy the conjunc- 
tion. First, we construct an automaton that accepts C{Ri)^C(R2)^ ■ This language 
is regular and requires an automaton of size 0(|-Ri | + \R2\)- Second, we intersect this 
with the linear weighted grammar that encodes the EditDistance constraint using the 
"triple construction". The size of the obtained grammar is = |Ged|(|-Ri| -|- |-R2|)^ 
and this grammar is a weighted hnear grammar. Therefore, we can use the Unear 
WeightedCFG(Z, N, Ga) constraint to encode the conjunction. Note that the size of 
Ga is only quadratic in | J?i | -|- 1 ii2 1 , because Ged is a linear grammar. The time complex- 
ity to enforce domain consistency on this conjunction of constraints is 0(ri^|GA|) = 
0{n^£{\Ri\ + |i?2|)^)foreachinvocationandO(n2(i2(|i?i| + \R2\)^ub{N)) downa 
branch of the search tree. 



Table 1. Performance of the encoding into WeightedCFG constraints shown in: number of 
instances solved in 60 sec / average number of choice points / average time to solve. 
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To evaluate the performance of the WeightedCFG(Z, A^, Ga) constraint we 
carried out a series of experiments on random problems. In out first model the 
conjunction of the EditDistance constraint and two Regular constraints was 
encoded with a single WeightedCFG(Z, A^, Ga) constraint. We call this model 
ED^. The second model contains the EditDistance constraint, encoded as 
WeightedCFG(Z, iV, Ged), and two Regular constraints. The Regular con- 
straint for the model ED Dec is implemented using a decomposition into ternary table 
constraints [11]. The WeightedCFG constraint is implemented with an incremental 
monolithic propagator [5]. The first Regular constraint ensures that there are at most 
two consecutive values one in the sequence. The second encodes a randomly generated 
string of Os and Is. To make problems harder, we enforced the EditDistance con- 



straint and the Regular constraints on two sequences X#(Y)^ and X'#(Y')^ of the 
same length 2n + 1. The EditDistance constraint and the first Regular constraint 
are identical for these two sequences, while Y and Y' correspond to different randomly 
generated strings of Os and Is. Moreover, X and X' overlap on 15% of randomly chosen 
variables. For each possible value of n G {15, 20, 25, 30, 35, 40, 45, 50}, we generated 
100 instances. Note that n is the length of each sequence X, Y, X' and Y'. N is the 
maximum edit distance between X and Y and between X' and Y'. We used a random 
value and variable ordering and a time out of 60 sec. Results for different values of n 
are presented in Table 1 . As can be seen from the table, the model ED/^ significantly 
outperforms the model ED Dec for larger problems, but it is slightly slower for smaller 
problems. Note that the model ED/^ solves many more instances compared to ED Dec- 

6 Conclusions 

Unlike parsing, restrictions on context free grammars such as determinism do not im- 
prove the efficiency of propagation of the corresponding global GRAMMAR constraint. 
On the other hand, one specific syntactic restriction, that of linearity, allows propaga- 
tion in quadratic time. We demonstrated an apphcation of such a restricted grammar in 
encoding the EditDistance constraint and more complex constraints. 
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